Thai Expressive Speech Processing Technology: A Review
نویسنده
چکیده
Problem statement: The studies on Thai expressive speech or emotional speech have been conducted for years. Most of them are expected to analysis the characteristics of Thai expressive speech. However, the conclusive reviews on these studies have not been conducted for further study on the speech technology or application of Thai expressive speech. Approach: The review of research on Thai expressive speech in various aspects has been performed. They include an analysis of fundamental frequency contours using Fujisaki’s model, an analysis of fundamental frequency contours using structural model and speech compression with noisy environments. It has been noted that four speaking emotions include enjoyable, sad, angry and reading styles. Results: A comparison of two successful F0 models has been reviewed. One approach is based on the Fujisaki’s model which has been applied for many tonal and toneless languages. Another one is based on the structural model which has been conducted primarily for Mandarin Chinese. Moreover, a study of speech compression for noise-corrupted Thai expressive speech by using two coding methods of CS-ACELP and MPCELP has been summarized. Conclusion: From the study, it can be seen that two mathematical models have been successfully applied to model the fundamental frequency contour of Thai expressive speech. As for speech compression, it can be seen that coding methods, types of noise, levels of noise, speech gender influence on the coding speech quality.
منابع مشابه
Corrigendum to: "Thai speech processing technology: A review" [Speech Communication 49 (1) (2007) 8-27]
In the first paragraph of Section 3.1.1, the sentence ''The first paper presenting an analysis of Thai tones from the age of ancient Thailand, Siam, was written by Bradley Bradley (1911). He illustrated the frequency curves of the five tones for Thai long syllables, similar to Fig. 1, " was not correctly cited from the reference: Bradley, C.B., 1911. Graphic analysis of the tone-accents of Siam...
متن کاملAnalytical Study on Fundamental Frequency Contours of Thai Expressive Speech Using Fujisaki’s Model
Problem statement: In spontaneous speech communication, prosody is an important factor that must be taken into account, since the prosody effects on not only the naturalness but also the intelligibility of speech. Focusing on synthesis of Thai expressive speech, a number of systems has been developed for years. However, the expressive speech with various speaking styles has not been accomplishe...
متن کاملPersian Adaptation of Enhanced Milieu Teaching for Iranian Children With Expressive Language Delay
Objectives: This study aimed at adapting and examining the applicability of the Teach-Model-Coach-Review model of the enhanced milieu teaching (EMT) approach for improving Iranian mothers’ language strategies while interacting with their toddlers with expressive language delay. Methods: In a single-subject multiple-baseline across-behavior study, the mothers of 3 toddlers with expressive langu...
متن کاملExpressive speech synthesis: a review
The objective of the present work is to provide a detailed review of expressive speech synthesis (ESS). Among various approaches for ESS, the present paper focuses the development of ESS systems by explicit control. In this approach, the ESS is achieved by modifying the parameters of the neutral speech which is synthesized from the text. The present paper reviews the works addressing various is...
متن کاملModeling of Fundamental Frequency Contour of Thai Expressive Speech using Fujisaki’s Model and Structural Model
Problem statement: In spontaneous speech communication, prosody is an important factor that must be taken into account, since the prosody effects on not only the naturalness but also the intelligibility of speech. Focusing on synthesis of Thai expressive speech, a number of systems has been developed for years. However, the expressive speech with various speaking styles has not been accomplishe...
متن کامل